Compact dual-port DRAM architecture system and method for making same

ABSTRACT

The present invention provides a process integration technique which significantly reduces the array size of dual-port DRAM architecture systems. The array is reduced to a size which is significantly smaller than the array size of prior art DRAM architecture systems by using bitlines formed at half-pitch. The present invention also provides dual-port, open-bitline and folded-bitline DRAM arrays where each DRAM cell in the array has at least two vertically-oriented devices therein.

PRIORITY

This application is a continuation-in-part application from an application filed on Aug. 29, 2000 having U.S. application Ser. No. 09/650,011 pending at time of issue.

FIELD OF THE INVENTION

This invention relates to the field of integrated circuit (IC) design. Specifically, it relates to a compact dual-port dynamic random access memory (DRAM) architecture system and method for making same.

BACKGROUND OF THE INVENTION

Each memory cell in a dual-port static random access memory (SRAM) chip is a buffer or flip-flop, and data is retained as long as power is maintained to the chip. SRAMs are realized with a bipolar technology, such as TTL, ECL, or I²L or with MOS technology, such as NMOS or CMOS. Bipolar SRAMs are relatively fast, having access times of 10 to 100 nsec. Power dissipation is also high, typically, 0.1 to 1.0 mW/bit. By contrast, MOS RAM access time is typically 100 nsec and power dissipation is 25 μSW/bit. The combination of high circuit density, low power dissipation, and reasonable access time has led to the dominance of MOS technology in the manufacture of RAM. Hence, dual-port SRAMs having high-speed buffers are widely used in devices and equipment necessitating high-speed and high performance, such as microprocessors, communication networks, facsimile machines, modems, etc.

Since the memory cells of SRAMs take up a relatively large surface area on a single integrated (IC) chip, IC design engineers, in an effort to increase the number of memory cells on the IC chip and make the chip smaller, have focused on improving dynamic RAM (DRAM) chips to make them suitable for high-speed, high performance devices and equipment. Currently, the ultimate in compactness, is a single-port DRAM chip where each memory cell uses a capacitor to store a charge and one transistor to gate it to sense amplifier circuits as shown by the prior art DRAM cell 10 of FIG. 1.

The DRAM cell 10 includes an access transistor 12, a storage capacitor 14, a bitline 16 and a wordline 18. During a write access, a wordline enable signal is asserted on wordline 18 thereby turning on transistor 12. A data signal is provided on bitline 16. This signal is routed through transistor 12 and stored in capacitor 14. During a read access, a wordline enable signal is asserted on wordline 18 to turn on transistor 12. The data signal stored in capacitor 14 is routed to bitline 16 through transistor 12. This data signal is amplified by a sense amplifier circuit (not shown) and then provided to the device initiating the read access.

A disadvantage of the single-port DRAM cell 10 is that it does not enable multi-port access, where more than one port can be accessed for simultaneously enabling reading, writing and/or refreshing of the memory cell. Multi-port access is required if the DRAM chip is to compete with or surpass the SRAM chip in terms of high-speed and high performance, while being simple and compact. Further, the single-port DRAM cell 10 has two additional disadvantage common to all types of DRAM cells. That is, the charge of each DRAM cell must be restored after the cell is read, and the charge in every cell must be refreshed periodically by periphery refresh circuitry.

Hence, the data rate, in terms of data access time and refresh cycle time, is low for a DRAM chip which prevents IC design engineers from implementing DRAM chips in devices and equipment requiring high-speed and high performance, such as microprocessors and communication networks. It is therefore a goal of IC design engineers to design a dual-port DRAM architecture system which can simultaneously perform two data access requests slated for a DRAM cell for increasing the data array's data rate while maintaining or furthering its compactness. Such a DRAM architecture system would be a better design choice over an SRAM architecture system for devices and equipment necessitating high-speed and high performance.

A dual-port DRAM cell is described in U.S. Pat. No. 5,923,593. The dual-port DRAM cell as shown by FIG. 4 in the patent is designed for staggering the read accesses. That is, during the first half of a clock cycle the first port is accessed and during the second half of the clock cycle the second port is accessed. In a similar manner, write accesses can be staggered. That is, during the first half of the clock cycle, the first port is accessed for writing to a cell and during the second half of the clock cycle, the second port is accessed for writing to the same or a different cell. However, as noted in the patent, such a “simultaneous” write access results in an indeterminate data value being written to the DRAM cells which affects the integrity of the data.

For example, if in the first half of the clock cycle, a logic “one” is written into a cell while a logic “one” is latched into the sense amplifier circuit, the same row can be accessed through the second port during the second half of the clock cycle for writing a logic “zero”. At this moment, due to charge sharing between the first sense amplifier circuit and the second sense amplifier circuit, the resulting charge stored in that cell will be in between “one” and “zero” and the data in the cell has an indeterminate data value. Similarly, a read-write access could also result in the same situation. For example, if a DRAM cell is originally stored with a logic “zero”, after a read out operation through the first port during the first half of the clock cycle, a logic “zero” is latched in the first sense amplifier circuit. If the same row is accessed through the second port during the second half of the clock cycle, and is written with a logic “one”, the data in the cell will again have an indeterminate data value due to charge sharing between the first sense amplifier circuit and the second sense amplifier circuit.

Additionally, the dual-port DRAM architecture system described in U.S. Pat. No. 5,923,593 is the classical open-bitline architecture system which is known to be susceptible to noise problems due to wordline-to-bitline coupling, bitline-to-bitline coupling, and bitline-to-substrate (or well) coupling which are well known in the DRAM industry. In an effort to mitigate the noise problems, the patent discloses the use of a dummy wordline swing in the adjacent array, as well as placing a dummy load in the edge array. However, with such a design configuration, the array size is significantly increased which undermines making the DRAM architecture system more compact. Additionally, operation of the dual-port DRAM becomes more complex due to having two arrays, rather than one array.

SUMMARY

An aspect of the present invention is to provide a dual-port DRAM architecture system for overcoming the disadvantages of the prior art.

Another aspect of the present invention is to provide a dual-port DRAM architecture system for simultaneously performing two access requests slated for a DRAM cell without affecting the integrity of the data.

Further, another aspect of the present invention is to provide a dual-port DRAM architecture system for prioritizing two simultaneous access requests slated for a DRAM cell.

Further still, another aspect of the present invention is to provide a dual-port DRAM architecture system for suppressing noise due to wordline-to-bitline coupling, bitline-to-bitline coupling, and bitline-to-substrate coupling.

Accordingly, in an embodiment of the present invention, a dual-port, folded-bitline DRAM architecture system is presented which prioritizes two simultaneous access requests slated for a DRAM cell of a data array prior to performing at least one of the access requests to prevent affecting the integrity of the data while suppressing noise due to wordline-to-bitline coupling, bitline-to-bitline coupling, and bitline-to-substrate coupling. If the two access requests are read-refresh, read-read or write-refresh, where the first access request is slated through a first port and the second access request is slated through a second port of a corresponding DRAM cell of the data array, the system prioritizes the access request slated through the first port at a higher priority than the access request slated through the second port. The system thus cancels the access request slated through the second port. If the two access requests are write-read, the system prioritizes the two access requests as being equal to each other. The system then simultaneously performs the write and read access by accessing the corresponding DRAM cell through the first port to write the data while simultaneously writing the data through to an output bus, which is equivalent to a read access.

In another embodiment of the present invention, a dual-port, shared-address bus DRAM architecture system is presented which can perform two access requests in a staggered manner. That is, in a first half of a clock cycle, a DRAM cell is accessed through a first port to perform a first access request and in a second half of the clock cycle, the DRAM cell is accessed through a second port to perform a second access request.

Similarly to the folded-bitline DRAM architecture system, this system also prioritizes two simultaneous access requests slated for the DRAM cell of a data array. If the two access requests are read-refresh, read-read or write-refresh, where the first access request is slated through the first port and the second access request is slated through the second port of a corresponding DRAM cell of the data array, the system prioritizes the access request slated through the first port at a higher priority than the access request slated through the second port. The system thus cancels the access request slated through the second port.

If the two access requests are write-read or read-write, the system prioritizes the two access requests as being equal to each other. The system then simultaneously performs the write and read access or the read and write access requests by accessing the corresponding DRAM cell through the first port or second port, respectively, to write the data while simultaneously writing the data through to an output bus, which is equivalent to a read access. If the two access requests are write-write, the system prioritizes the write access request slated through the second port at a higher priority than the write access request slated through the first port. The system thus cancels the write access request slated through the first port.

This system further includes shared-address buses, thereby enabling control circuitry to be shared by both ports, since only one port of the corresponding DRAM cell can be used at a time due to staggering access requests. Hence, less control circuitry is required and all of the control circuitry can be provided at one side of the data array. Accordingly, the fabrication cost and the amount of surface area used in implementing the system are decreased.

Prioritization is realized, in order to maintain data integrity in both DRAM architecture systems, by designating one port of each DRAM cell a master port and the other port a slave port, where the access request slated through the master port has typically a higher priority than the access request slated through the slave port. Accordingly, accesses to the DRAM cell through the master port, with some exceptions, take precedence over accesses through the slave port.

Each DRAM architecture system suppresses noise due to wordline-to-bitline coupling, bitline-to-bitline coupling, and bitline-to-substrate coupling by providing at least a complementary bitline on the data array for each true bitline to form bitline pairs.

The present invention provides a process integration technique which significantly reduces the array size of dual-port DRAM architecture systems. The array is reduced to a size which is significantly smaller than the array size of prior art DRAM architecture systems by using bitlines formed at half-pitch. The present invention also provides dual-port, open-bitline and folded-bitline DRAM arrays where each DRAM cell in the array has at least two vertically-oriented devices therein.

BRIEF DESCRIPTION OF THE FIGS.

FIG. 1 is a schematic diagram of a prior art single-port DRAM cell;

FIG. 2 is a diagram of a dual-port, folded-bitline DRAM architecture system according to the present invention;

FIG. 3A is a diagram of a first dual-port DRAM data array arrangement for the dual-port, folded-bitline DRAM architecture system of FIG. 2;

FIG. 3B is a diagram of a second dual-port DRAM data array arrangement for the dual-port, folded-bitline DRAM architecture system of FIG. 2;

FIG. 3C is an enlarged view of a portion of FIG. 3B;

FIG. 4 is a waveform diagram of the operation of a master and a slave port of the dual-port, folded-bitline DRAM architecture system of FIG. 2;

FIG. 5 is a schematic diagram of a slave port control circuit of the dual-port, folded-bitline DRAM architecture system of FIG. 2;

FIG. 6 is a schematic diagram of a master port control circuit of the dual-port, folded-bitline DRAM architecture system of FIG. 2;

FIG. 7 is a schematic diagram of a priority circuit of the dual-port, folded-bitline DRAM architecture system of FIG. 2;

FIG. 8 is a diagram of a dual-port, shared-address bus DRAM architecture system according to the present invention;

FIG. 9 is a waveform diagram of the operation of a master and a slave port of the dual-port, shared-address bus DRAM architecture system of FIG. 8;

FIG. 10 is a schematic diagram of a slave port control circuit of the dual-port, shared-address bus DRAM architecture system of FIG. 8;

FIG. 11 is a schematic diagram of a master port control circuit of the dual-port, shared-address bus DRAM architecture system of FIG. 8;

FIG. 12 is a schematic diagram of a priority circuit of the dual-port, shared-address bus DRAM architecture system of FIG. 8;

FIG. 13 is a diagram of an open-bitline, dual-port DRAM cell according to the present invention;

FIG. 14 is a diagram of an open-bitline, dual-port DRAM cell array layout according to the present invention;

FIG. 15 is a cross sectional view of the open-bitline, dual-port DRAM cell shown by FIG. 14 taken along line A—A in FIG. 14;

FIG. 16 is a diagram of a folded-bitline, dual-port DRAM cell of a data array having a half-pitch bitline layout according to a first embodiment of the present invention;

FIG. 17 is a cross-sectional view of the folded-bitline, dual-port DRAM cell shown by FIG. 16 taken along line B—B in FIG. 16;

FIG. 18 is a cross-sectional view of the top surface of the folded-bitline, dual-port DRAM cell shown by FIG. 16 taken along line C—C in FIG. 16;

FIG. 19 is a cross-sectional view of a prior art open-bitline, dual-port DRAM cell;

FIG. 20 is a diagram illustrating a top view of the half-pitch bitline layout of the data array having the DRAM cell shown by FIG. 16;

FIG. 21 is a diagram illustrating the folded-bitline, dual-port DRAM cell array layout according to the first embodiment of the present invention;

FIG. 22 is a diagram illustrating a folded-bitline, dual-port DRAM cell array layout according to a second embodiment of the present invention;

FIG. 23 is a diagram illustrating a folded-bitline, dual-port DRAM cell array layout according to a third embodiment of the present invention;

FIG. 24 is a diagram illustrating a folded-bitline, dual-port DRAM cell array layout according to a fourth embodiment of the present invention;

FIG. 25 is a diagram illustrating a folded-bitline, dual-port DRAM cell array layout according to a fifth embodiment of the present invention; and

FIG. 26 is a diagram illustrating a folded-bitline, dual-port DRAM cell array layout according to a sixth embodiment of the present invention.

DETAILED DESCRIPTION OF THE INVENTION

The present invention provides a dual-port, folded-bitline DRAM architecture system which prioritizes two simultaneous access requests slated for a DRAM cell of a data array prior to performing at least one of the access requests to prevent affecting the integrity of the data while suppressing noise due to wordline-to-bitline coupling, bitline-to-bitline coupling, and bitline-to-substrate coupling. If the two access requests are read-refresh, read-read or write-refresh, where the first access request is slated through a first port and the second access request is slated through a second port of a corresponding DRAM cell of the data array, the system prioritizes the access request slated through the first port at a higher priority than the access request slated through the second port. The system thus cancels the access request slated through the second port. If the two access requests are write-read, the system prioritizes the two access requests as being equal to each other. The system then simultaneously performs the write and read access by accessing the corresponding DRAM cell through the first port to write the data while simultaneously writing the data through to an output bus, which is equivalent to a read access.

In another embodiment of the present invention, a dual-port, shared-address bus DRAM architecture system is presented which can perform two access requests in a staggered manner. That is, in a first half of a clock cycle, a DRAM cell is accessed through a first port to perform a first access request and in a second half of the clock cycle, the DRAM cell is accessed through a second port to perform a second access request.

Similarly to the folded-bitline DRAM architecture system, this system also prioritizes two simultaneous access requests slated for the DRAM cell of a data array. If the two access requests are read-refresh, read-read or write-refresh, where the first access request is slated through the first port and the second access request is slated through the second port of a corresponding DRAM cell of the data array, the system prioritizes the access request slated through the first port at a higher priority than the access request slated through the second port. The system thus cancels the access request slated through the second port.

If the two access requests are write-read or read-write, the system prioritizes the two access requests as being equal to each other. The system then simultaneously performs the write and read access or the read and write access requests by accessing the corresponding DRAM cell through the first port or second port, respectively, to write the data while simultaneously writing the data through to an output bus, which is equivalent to a read access. If the two access requests are write-write, the system prioritizes the write access request slated through the second port at a higher priority than the write access request slated through the first port. The system thus cancels the write access request slated through the first port.

This system further includes shared-address buses, thereby enabling control circuitry to be shared by both ports, since only one port of the corresponding DRAM cell can be used at a time. Hence, less control circuitry is required and all of the control circuitry can be provided at one side of the data array. Accordingly, the fabrication cost and the amount of surface area used in implementing the system are decreased.

Prioritization is realized, in order to maintain data integrity in both DRAM architecture systems, by designating one port of each DRAM cell a master port and the other port a slave port, where the access request slated through the master port has typically a higher priority than the access request slated through the slave port. Accordingly, accesses to the DRAM cell through the master port, with some exceptions, take precedence over accesses through the slave port.

Each DRAM architecture system suppresses noise due to wordline-to-bitline coupling, bitline-to-bitline coupling, and bitline-to-substrate coupling by providing at least a complementary bitline on the data array for each true bitline to form bitline pairs.

In a first section of this disclosure, the dual-port, folded-bitline DRAM architecture system is described. In a subsequent section, the dual-port, shared-address bus DRAM architecture system is described.

I. Dual-port, Folded-Bitline DRAM Architecture System

A. System Configuration and Suppressing Noise Due to the Coupling Effect

Unlike the prior art dual-port DRAM architecture systems, the dual-port, folded-bitline DRAM architecture system of the present invention is capable of suppressing noise due to several reasons. First, when one wordline is activated, it couples up both true and complementary bitlines, i.e., a pair of bitlines, by the same amount. This is due to the fact that both true and complementary bitlines have identical loads and identical wordline-to-bitline coupling capacitance. As a result, the voltage level difference between the true and complementary bitlines does not change by the coupling effect during operation of the dual-port, folded-bitline DRAM architecture system, which prevents noise from building up due to wordline-to-bitline coupling.

Second, the dual-port, folded-bitline DRAM architecture system of the present invention, completely cancels out bitline-to-bitline coupling by providing a bitline-twist configuration as known in the art for a single-port DRAM architecture system, since under all circumstances, any bitline will always have its paired twisted-bitline(s) to completely cancel the bitline-to-bitline coupling effect.

On the contrary, in the open-bitline DRAM architecture system of the prior art, there is no canceling out of the bitline-to-bitline coupling effect, because there is no pairing of bitlines. In the open-bitline system, the worst case is when there is one falling bitline among two adjacent rising bitlines, or, one rising bitline among two adjacent falling bitlines. This single rising or falling bitline experiences the worst bitline-to-bitline coupling effect. The data signal transmitted via this bitline is thereby, significantly weakened.

Third, during operation of the dual-port, folded-bitline DRAM architecture system, and especially, during signal amplification by the sense amplifier circuits, one bitline of the pair of bitlines always rises up to the supply voltage or Vdd and the other bitline of the pair of bitlines falls down to ground. At this moment, the over-all bitline-to-substrate coupling effect is theoretically canceled out.

On the contrary, in the open-bitline DRAM architecture system of the prior art, there is no canceling out of the bitline-to-substrate coupling effect, because there is no pairing of bitlines. The worst bitline-to-substrate coupling effect case for the open-bitline DRAM architecture system is when the whole array of bitlines rises up to Vdd, and thus the substrate of the active array is coupled up. This causes unnecessary cell leakage caused by threshold lowering of the transfer devices due to substrate voltage rising within the substrate which gives rise to noise.

To counteract the bitline-to-substrate coupling effect, the dual-port, open-bitline DRAM architecture system disclosed by U.S. Pat. No. 5,923,593 provides an adjacent reference or dummy array. As the substrate of the active array is coupled up, the substrate of the adjacent reference array is coupled down to cancel out the overall bitline-to-substrate coupling effect. This design configuration requires more components which increase the fabrication cost and the surface area utilized.

With reference to FIG. 2, there is shown the dual-port, folded-bitline DRAM architecture system of the present invention, which is designated generally by reference numeral 100. Each DRAM cell of the system 100, including the DRAM cell 160 shown by FIG. 2, is a dual-port DRAM cell, i.e., it contains a master port 102 and a slave port 104. Since each DRAM cell is a dual-port DRAM cell, it contains two transfer transistors N1 and N2, and one storage capacitor C1.

The components for driving the master port 102 are located to the left of a data array 106. The components for driving the slave port 104 are located to the right of the data array 106. The master port components are a master row decoder 108, a master wordline driver 110, a master column decoder 112, a master I/O switch 114, a master sense amplifier circuit 116, a master row/column address buffer 118, a master port control circuit 120, and an I/O buffer 122. The structure and operation of all of the components of the master port 102 and their inter-connections are known in the art, except for the master port control circuit 120 which controls access through the master port 102. The master port control circuit 120 is described in detail below with reference to FIG. 6.

The slave port components are a slave row decoder 128, a slave wordline driver 130, a slave column decoder 132, a slave I/O switch 134, a slave sense amplifier circuit 136, a slave row/column address buffer 138, a slave port control circuit 140, and an output buffer 142. Unlike the master port 102 which is used to perform read and write accesses and hence, warrants the need for an I/O buffer, the slave port 104 is only used to perform refresh and read accesses, and therefore, only an output buffer is needed for the slave port 104. The structure and operation of all of the components of the slave port 104 and their inter-connections are known in the art, except for the slave port control circuit 140 which controls access through the slave port 104. The slave port control circuit 140 is described in detail below with reference to FIG. 5.

The system 100 further includes a refresh address generator 150 which generates a slave refresh address. The slave refresh address is transmitted to the slave row/column address buffer 138. The slave refresh address indicates typically the row address of the DRAM cell which requires refreshing, in order for that DRAM cell to be refreshed through the slave port 104.

One refresh method is a burst refresh. A refresh counter of the refresh address generator 150 activates a wordline at a time to refresh the DRAM cells in a particular row of the array 106 until all the DRAM cells in the array 106 are refreshed as known in the art. The counter then waits for a refresh clock to clock a subsequent refresh cycle.

The burst refresh access can be described as being hidden, since the refresh address generator 150 is not controlled by external circuitry, such as an external processor. It is mainly designed to perform independent refresh accesses to restore charges after a read access or after the charges have leaked off from the DRAM cells.

Hence, the external processor is unaware when a refresh access is in process through the slave port 104. The external processor is only programmed to be aware that only the master port 102 of each DRAM cell in the data array 106 is always available for read and write accesses, and the slave port 104 is sometimes available for read accesses.

When the slave port 104 is not being operated in a refresh mode, the slave refresh address is transmitted to the slave row/column address buffer 138 and it is stored therein. When the system 100 operates in the refresh mode, the slave refresh address is read and the DRAM cell corresponding to the slave refresh address is refreshed through the slave port 104, as further described below.

A slave port control signal RAS determines the operating mode of the slave port 104, i.e., whether the slave port 104 is to be operated in the refresh mode or read mode, as described below. The slave port control signal RAS is generated and transmitted to the slave port control circuit 140 by external circuitry, such as a memory controller or processor.

The system 100 further includes master bitlines BLM and slave bitlines BLS coupling the master port 102 and slave port 104, respectively, of each dual-port DRAM cell (one DRAM cell 160 is shown by FIG. 2) of the data array 106 to the master sense amplifier circuit 116 and the slave sense amplifier circuit 136, respectively. Each bitline includes an adjacent, non-active bitline for realizing the folded-bitline architecture to suppress noise due to the coupling effect.

Two pairs of complementary bitlines BLS are shown by FIG. 2, i.e., BLM(i), BLM(i+1) and BLS(i), BLS(i+1). The first two bitlines BLM(i), BLM(i+1) are the master bitlines and are grouped as the first pair feeding into the master sense amplifier circuit 116, while the second two bitlines BLS(i), BLS(i+1) are the slave bitlines and are grouped as the second pair feeding into the slave sense amplifier circuit 136.

Further, the system 100 includes a corresponding master wordline WLM and slave wordline WLS connecting the dual-port DRAM cell 160 of the data array 106 to the master wordline driver 110 and the slave wordline driver 130, respectively, as known in conventional DRAM circuit design. The master port 102 is accessed from the first wordline WLM while the slave port 104 is accessed from the second wordline WLS according to traffic control rules of the present invention outlined in the following sub-section. Finally, the system 100 includes data input and data output lines 152, 154, in order for the system 100 to communicate with external devices by receiving and transmitting data as known in the art.

With reference to FIG. 3A, the grouping of the bitlines into pairs in accordance with the present invention is further illustrated as the data array 106 is shown in greater detail. The data array 106 includes four rows and N columns for a total of 4×N DRAM cells. It is contemplated that the data array 106 can have any number of rows and columns of DRAM cells with the appropriate number of wordlines and bitlines.

The first pair of bitlines BLM(l), BLM(2), the third pair of bitlines BLM(3), BLM(4), and the N−1th pair of bitlines BLM(N−1), BLM(N) are master bitline pairs; the second pair of bitlines BLS(1), BLS(2), the fourth pair of bitlines BLS(3), BLS(4), and the Nth pair of bitlines BLS(N−1), BLS(N) are slave bitline pairs. Each of the master bitline pairs are fed to a corresponding amplifier AMP_M1, AMP_M2 AMP_MN within the master sense amplifier circuit 116; the slave bitline pairs are fed to a corresponding amplifier AMP_S1, AMP_S2, AMP_SN within the slave sense amplifier circuit 136. The data array 106 further includes four master wordlines WLM1-WLM4 and four slave wordlines WLS1-WLS4.

Further, each DRAM cell 200A-203N has a pair of parasitic capacitors, such as C2 and C3 shown for DRAM cell 200A. One of the parasitic capacitors connects a true bitline to a wordline and the other parasitic capacitor connects the true bitline's complementary bitline to the same wordline. Hence, the parasitic capacitors are the wordline-to-bitline pair coupling capacitors. The true and complementary bitlines have approximately equal loads, i.e., the bitlines are connected to the same number of devices, and approximately equal wordine-to-bitline coupling capacitance. The preferred capacitance range for the parasitic capacitors C2, C3 is approximately 10 fentofarads.

Hence, it has been demonstrated that the parasitic capacitors C2, C3 prevent the voltage level difference between the true and complementary bitlines from changing, i.e., the voltage level of the true bitline is increased while the voltage level of the complementary bitline is decreased, due to the wordline-to-bitline coupling effect, thereby suppressing noise which is created due to this effect. That is, for example, when the wordline WLM1 is activated for performing an operation within DRAM cell 200A as known in the art, the wordline WLM1 couples up, i.e., brings to a higher voltage level, both the true bitline BLM(1) and its complementary bitline BLM(2) by the same voltage amount due to an increase in the voltage potential of the parasitic capacitors C2, C3, thereby the net effect due to wordline-to-bitline coupling is canceled. As a result, noise created by the wordline-to-bitline coupling effect is significantly suppressed.

In the open-bitline DRAM architecture system of the prior art, when a wordline's voltage level rises during signal development, one of the bitlines is coupled up, i.e., its voltage level is increased. However, its complementary bitline which is connected to the dummy array is coupled down, i.e., its voltage level is decreased. Accordingly, noise created by the wordline-to-bitline coupling effect is not suppressed in the open-bitline system.

Noise created by the bitline-to-substrate coupling effect during a signal sensing period has also been shown to be significantly suppressed by the dual-port, folded-bitline DRAM architecture system 100 of the present invention. During signal amplification, within a pair of bitlines, by an amplifier within the master sense amplifier circuit 116 and the slave sense amplifier circuit 136, the true bitline will always rise up to Vdd and its complementary bitline will fall down to ground. Hence, when the true bitline rises to Vdd, it couples the substrate up to Vdd due to bitline-to-substrate capacitance, i.e., a voltage potential between the true bitline and ground. Similarly, when its complementary bitline drops to ground, it couples the substrate down to ground due to bitline-to-substrate capacitance, and the overall bitline-to-substrate coupling effect is canceled out. As a result, noise created by the bitline-to-substrate coupling effect is significantly suppressed.

It is noted that sometimes the bitline will not rise up to Vdd, but to a voltage level less than Vdd; also, sometimes the other bitline will not fall down to ground, but to a higher voltage level than ground. However, for simplicity, Vdd and ground are used herein.

Further, each pair of bitlines has at least a pair of parasitic capacitors, such as C4 and C5 shown for bitline pair BLM(1), BLM(2), having approximately the same capacitance. The preferred capacitance for the parasitic capacitors C4 and C5 is in the range of 30-60 fentofarads. One of these parasitic capacitors reflects the capacitance of the true bitline and the other parasitic capacitor reflects the capacitance of the complementary bitline.

With reference to FIG. 3B, in order to cancel the net effect due to bitline-to-bitline coupling, the bitlines need to be arranged in a bitline-twist configuration as illustrated by dual-port, folded-bitline DRAM data array 106A. Similarly to parasitic capacitors C4 and C5 for DRAM array 106, parasitic capacitors C4′ and C5′ for DRAM array 106A have equal capacitance. Therefore, as it has been demonstrated, each true bitline will be either rising or falling during a DRAM cell operation and its complementary bitline will be doing the opposite, thereby the net effect due to bitline-to-bitline coupling is canceled. As a result, noise created by the bitline-to-bitline coupling effect is significantly suppressed.

For example, as shown by FIG. 3C which depicts a portion of DRAM array 106A of FIG. 3B, as the true bitline rises to approximately Vdd during operation of the memory system, its complementary bitline drops to ground, and vice versa, thereby canceling any bitline-to-bitline coupling. Consequently, noise attributed to bitline-to-bitline coupling is suppressed.

The bitline is either brought up to approximately Vdd from ½ Vdd or brought down to ground from ½ Vdd during the signal sensing period due to at least a pre-charge circuit within the master and slave sense amplifiers 116A, 136A, as known in the art. Accordingly, the voltage potential, i.e., parasitic capacitance, between each pair of bitlines, for example, between true bitline BLS(1) and its complementary bitline BLS(2) in FIG. 3C at point A during signal development, rises from ½ Vdd to approximately Vdd and also drops from ½ Vdd to ground, for example, at point B in FIG. 3C. Hence, bitline-to-bitline coupling is canceled, thereby suppressing noise due to such coupling.

The data array 106A includes four rows and N columns for a total of 4×N DRAM cells. It is contemplated that the data array 106A can have any number of rows and columns of DRAM cells with the appropriate number of wordlines and bitlines.

The second pair of bitlines BLM(1), BLM(2), the fourth pair of bitlines BLM(3), BLM(4), and the Nth pair of bitlines BLM(N−1), BLM(N) are master bitline pairs; the first pair of bitlines BLS(1), BLS(2), the third pair of bitlines BLS(3), BLS(4), and the N−1th pair of bitlines BLS(N−1), BLS(N) are slave bitline pairs. Each of the master bitline pairs are fed to a corresponding amplifier AMP_M1, AMP_M2, AMP_MN within the master sense amplifier circuit 116A; the slave bitline pairs are fed to a corresponding amplifier AMP_S1, AMP_S2, AMP_SN within the slave sense amplifier circuit 136A. The data array 106A further includes four master wordlines WLM1-WLM4 and four slave wordlines WLS1-WLS4.

It is contemplated that each bitline is twisted more than once for DRAM arrays having more than four rows. For example, for a DRAM array having six rows, each bitline will be twisted two times, and for a DRAM array having ten rows, each bitline will be twisted four times.

Similarly to DRAM array 106, each DRAM cell 200A-203N has a pair of parasitic capacitors, such as C2′ and C3′ shown for DRAM cell 200A. One of the parasitic capacitors connects a true bitline to a wordline and the other parasitic capacitor connects the true bitline's complementary bitline to the same wordline. Hence, as described above, these parasitic capacitors are the wordline-to-bitline pair coupling capacitors. The true and complementary bitlines have approximately equal loads, i.e., the bitlines are connected to the same number of devices, and approximately equal wordine-to-bitline coupling capacitance. The preferred capacitance range for the parasitic capacitors C2′, C3′ is approximately 10 fentofarads.

As indicated above, each pair of bitlines also has a pair of parasitic capacitors, i.e., capacitors C4′ and C5′ shown for bitline pair BLS(1), BLS(2), having approximately the same capacitance. The preferred capacitance for the parasitic capacitors C4′ and C5′ is in the range of 30-60 fentofarads. One of these parasitic capacitors reflects the capacitance of the true bitline and the other parasitic capacitor reflects the capacitance of the complementary bitline.

It is preferred that the system 100 includes the DRAM array 106A for maximum suppression of noise due to the coupling effect, since DRAM array 106A is designed to suppress noise due to wordline-to-bitline coupling and bitline-to-substrate coupling as described with reference to FIG. 3A, as well as bitline-to-bitline coupling, whereas DRAM array 106 is designed to suppress noise due to wordline-to-bitline coupling and bitline-to-substrate coupling only.

Additionally, each wordline, in both DRAM arrays 106 and 106A, includes a parasitic capacitor, such as C6 for DRAM array 106 and C6′ for DRAM array 106A for wordline WLM1. The parasitic capacitors C6 and C6′ represent the wordline capacitance or voltage potential between a particular wordline and ground. The parasitic capacitors C6 and C6′ have a preferred capacitance of approximately one picofarad. It is noted that all of the parasitic capacitors, i.e., C2-C6 for DRAM array 106 and C2′-C6′ for DRAM array 106A, are realized due to wiring, gate oxide of the devices, etc.

Additionally, the system 100 includes a comparator 156, as further described below, for determining simultaneous access requests through the master port 104 and slave port 102 of a DRAM cell within the data array 106, such as the DRAM cell 160.

B. Maintaining Data Integrity

A description will now be made as to the operation of the dual-port, folded bitline DRAM architecture system 100 with reference to FIGS. 4-7, and especially how the system 100 maintains data integrity. In order to maintain data integrity, the accessing of the same wordline from two ports of a DRAM cell should be avoided. However, in theory, for a dual-port DRAM, both ports should be freely accessible to improve performance and data processing speed. The key of the present invention is a built-in traffic control logic system to handle any possible address collisions which occur when an attempt is made to simultaneously access a DRAM cell through both ports.

With reference to the waveform diagrams of FIG. 4, during the refresh mode, e.g., at a time when a burst (or distributed refresh) access occurs, the slave port 104 is not available. When the slave port 104 is not in the refresh mode, the slave port 104 becomes available for a read access only. A read or write access can be performed by the master port 102 at any time. A write access can only be performed through the master port 102.

Accordingly, the dual-port, folded bitline DRAM architecture system 100 can be described as behaving like an SRAM architecture system, because like the SRAM system, one port is always randomly accessible for a read or write access, while a refresh access is hidden, i.e., the refresh access can be performed at any time without requiring a specific refresh request from external circuitry, such as an external processor. However, the performance of the dual-port, folded bitline DRAM architecture system 100 is generally better than the single-port SRAM architecture system. For example, if the cycle time of the single-port SRAM is less than two times faster than the dual-port, folded-bitline DRAM architecture system 100, then by using the dual-port DRAM system 100 the data rate will be equivalent or better than using the single-port SRAM.

The master port and slave port control circuits 120, 140 for the dual-port, folded-bitline DRAM architecture system 100 are designed to operate according to the following four traffic rules:

1. If a DRAM cell is accessed via a wordline through either the master port 102 or slave port 104 for a read access, then following the read access, the DRAM cell is automatically refreshed. Therefore, there is no need to perform a refresh access for that particular wordline anymore, and a simultaneous refresh request slated through the slave port 102 is canceled. Also, if the DRAM cell is accessed via the wordline through the master port 102 for a write access, a simultaneous refresh request slated through the slave port 104 on the same wordline is also canceled;

2. If the DRAM cell is accessed via two ports of the same row address wordlines for simultaneously performing write and read operations based upon a write access request slated through the master port 540 and a read access request slated through the slave port 536, it is always desirable, that the most recent data written in the DRAM cell is read out. Therefore, the data is written to the DRAM cell through the master port 102, since, as indicated above, data can only be written to the DRAM cell through the master port 102. While the data is being written to the DRAM cell, the system 100 writes the data through to the output data bus 154 via at least the I/O buffer 122 which is equivalent to a read operation. That is, as the data is received by the I/O buffer 122 via the input data bus 152, if the data is to be written to the output data bus 154, the I/O buffer 122 transmits the data to the output data bus 154 while simultaneously transmitting the data to the DRAM cell. Accordingly, the data is simultaneously written to the DRAM cell and to the data bus 154 to satisfy the write and read requests. Since the read request is satisfied, the read request slated through the slave port 104 is canceled;

3. If the DRAM cell is accessed via two ports of the same row address wordlines for two simultaneous read operations, the read request slated through the slave port 104 is ignored, since one read operation is adequate; and

4. Since a write access is not possible through the slave port 104 (a write access can only be performed through the master port 102), there can be no write-write address collisions.

The two control circuits 120, 140 for the master port 102 and the slave port 104, respectively, are shown schematically by FIGS. 5 and 6. The slave port control circuit 140 has five input nodes for receiving five different input signals. These signals include a write request signal W, a read request signal R, a refresh request signal F, the slave port control signal RAS, and a match signal M. The write and read request signals are received from external circuitry, such as a processor, for performing write and/or read operations. The slave port control signal RAS, as indicated above, is also received from external circuitry, such as a processor.

The match signal M is provided by the comparator 156 of a priority circuit as described below with reference to FIG. 7. The slave port control circuit 140 also has one output node for transmitting a slave port output signal SS to slave port circuitry, such as the slave row decoder 128 and slave wordline driver 130, for allowing or canceling access to the slave port 104, in accordance with the logic value of the match signal M. The slave row decoder 128 is connected to the slave wordline driver 130 for driving the slave port wordlines (WLS), as known in conventional DRAM circuit design.

With reference to FIG. 6, the master port control circuit 120 includes three input nodes for receiving the write request signal W, the read request signal R, and the match signal M. The master port control circuit 120 further includes an output node for transmitting a write-through signal WTS to master control circuitry, such as the master row decoder 108, the master wordline driver 110, and the I/O buffer 122, for simultaneously performing a write and read access. The master row decoder 108 is connected to the master wordline driver 110 for driving the master port wordlines (WLM), as known in conventional DRAM circuit design.

The operation of the master port and slave port control circuits 120, 140 will now be described with continued reference to FIGS. 5 and 6, and to the priority circuit 170 shown by FIG. 7 which includes the comparator 156, and the master port and slave port control circuits 120, 140. The priority circuit 170 uses three task request signals, i.e., the write request signal W, the read request signal R, and the refresh request signal F, to determine whether the slave port output signal SS outputted by the slave port control circuit 140 and the write-through signal WTS outputted by the master port control circuit 120 should be a logic high or a logic low.

The slave port control circuit 140 is designed to transmit a logic high slave port output signal SS to slave port control circuitry, such as the slave row decoder 128 and slave wordline driver 130, when the master and slave row addresses inputted into the comparator 156 are different to allow simultaneous access to both the master port 102 and slave port 104. Further, the slave port control circuit 140 is designed to transmit a logic low slave port output signal SS when the row addresses selected by the master row decoder 108 and slave row decoder 128 are identical or match, since the task requested through the master port 102 has a higher or equal priority to the task requested through the slave port 104 according to the traffic rules.

The master and slave row addresses are extracted from master and slave addresses received by the master row/column address buffer 118 and the slave row/column address buffer 138, respectively, from external circuitry, such as a processor, as known in the art. For example, when the external circuitry requires access to the DRAM data array 106 to write and/or read data to and from a memory cell therein.

Accordingly, after the comparator 156 of the priority circuit 170 compares the master and slave row addresses, a logic high match signal M is transmitted to the master port and slave port control circuits 120, 140, if the row addresses are identical, and the slave port control signal RAS is disabled. As a result, the slave port control circuit 140 transmits a logic low slave port output signal SS to the slave port circuitry to cancel access to the slave port 104, and hence, cancel the task scheduled to be performed through the slave port 104 to avoid data integrity problems. Access to the master port 102 is never canceled, and hence, the task scheduled to be performed through the master port 102 is never canceled. This task is thereby subsequently performed.

In essence, the priority circuit 170, shown by FIG. 7, prioritizes the tasks scheduled to be performed, where the tasks scheduled to be performed through the master port 102 are given a higher priority over the tasks scheduled to be performed through the slave port 104 when the row addresses are identical. However, if the task scheduled to be performed through the master port 102 is a write access and the task scheduled through the slave port 104 is a read access, then the 10 tasks are performed simultaneously by writing to the DRAM cell 160 through the master port 102 and simultaneously writing through to the output data bus 154. The read access request slated through the slave port 104 is canceled.

At the slave port control circuit 140, the write request signal W is inputted through a first inverter INV1. The output of the inverter INV1 is inputted to a first and a second AND gate, AND1, AND2. The output of the first AND gate, AND1, is inputted to a top input of multiplexer MUX1. The read request signal R is inputted to the first AND gate, AND1, a bottom input of the multiplexer MUX1, and a second inverter INV2. The output of the second inverter INV2 is inputted to the second AND gate, AND2. The output of the second AND gate, AND2, is inputted to a top input of the multiplexer MUX2. The refresh request signal F is inputted to the second AND gate, AND2, and a bottom input of the multiplexer MUX2.

The output of the multiplexer MUX1 is inputted to a third AND gate, AND3. The output of the multiplexer MUX2 is inputted to a fourth AND gate, AND4. The slave port control signal RAS is inputted to the third and fourth AND gates, AND3, AND4. The output of the third and fourth AND gates, AND3, AND4 is inputted to an OR gate, which outputs the slave port output signal SS.

With respect to the master port control circuit 120, as shown by FIG. 6, the write and read request signals are inputted to an AND gate, AND. The output of the AND gate, AND, is inputted to a top input of a multiplexer MUX. A bottom input of the multiplexer MUX, is tied to ground. The output of the multiplexer MUX, is the write-through signal WTS.

As described above, based on the logic level of the match signal M, the output of the multiplexers MUX, MUX1, MUX2 is either the top input or the bottom input. The three multiplexers MUX, MUX1, MUX2 receive the match signal M transmitted by the comparator 156. Hence, the match signal M becomes a control signal for the multiplexers MUX, MUX1, MUX2. If the match signal M is a logic high signal, i.e., the row addresses are identical, the top input of multiplexers MUX, MUX1, MUX2 is outputted. If the match signal M is a logic low signal, i.e., the row addresses are not identical, the bottom input of multiplexers MUX1, MUX2 is outputted.

Table 1 shows the input configurations for the write request signal W, the read request signal R, and the refresh request signal F, and the outputs of the multiplexers MUX1, MUX2 (including the output of the multiplexer MUX of the master port control circuit 120, i.e., the write-through signal WTS) for a logic high and a logic low match signal M. Table 1 also shows the logic level of the slave port control signal RAS and the slave port output signal SS.

TABLE 1 Write, read and refresh request input configurations, multiplexer outputs, and logic levels of the slave port control signal RAS and the slave port output signal SS. MUX1 MUX2 MUX M W R F Output Output Output RAS SS WTS 0 1 0 1 0 1 0 1 1 0 0 1 1 0 1 0 0 1 1 0 1 1 0 1 0 0 0 0 0 0 1 1 1 0 0 0 1 0 0 1

It is noted that since a read request and a refresh request cannot be simultaneously performed by the slave port 104, when the read request signal R is a logic high, the refresh request signal F is a logic low, and vice versa. Also, it can be observed from Table 1 that when the match signal M is a logic low signal, the slave port control signal RAS is enabled, and when the match signal M is a logic high signal, the slave port control signal RAS is disabled to output a logic high or a logic low slave port output signal SS, respectively. That is, if the master and slave row addresses are not identical, the task requested through the slave port 104 and the task requested through the master port 102 are performed simultaneously. If the master and slave row addresses are identical, the task requested through the master port 102 takes precedence, unless it is a write-read request, where both tasks are performed simultaneously due to a high write-through signal WTS, as described below.

Table 1 does not show any input configurations where only one of the request signals is a logic high signal, because if this occurs, there can be no address collisions and the master and slave port control circuits 120, 140 are not accessed.

In the case of a write-read request as shown by the last row of Table 1, i.e., when both a write and a read request are simultaneously requested via the same row address to be performed through the master and slave ports 102, 104, respectively, the master port control circuit 120, shown by FIG. 6, outputs a high write-through signal WTS. This is accomplished by inputting the write request signal W and the read request signal R to the AND gate, AND. Since the write request signal W and the read request signal R are logic high signals when a write-read access request is made, the output of the AND gate, AND, is a logic high signal. Since the match signal M is a logic high signal, the top input of the multiplexer MUX is outputted, i.e., a logic high write-through signal WTS, to system control circuitry, in order for the system 100 to simultaneously perform both access requests. As described above, the system 100 is designed to write the data to the DRAM cell 160 and simultaneously write the data through to the output data bus 154 to simultaneously perform both access requests.

If the row address are not identical, the comparator 156 outputs a logic low match signal M to the multiplexer MUX and the multiplexer MUX outputs the bottom input. Since the bottom input of the multiplexer MUX is tied to ground, the output of the multiplexer MUX is a logic low signal, i.e., a logic low write-through signal WTS is outputted, as shown by the first and second rows of Table 1.

In conclusion, since the dual-port, folded-bitline DRAM architecture system 100 inputs the task request signals and the slave port control signal RAS within the priority circuit 170, when an address collision occurs, the system 100 maintains data integrity, while maintaining a smooth, simultaneous dual-port operation.

II. Dual-port, Shared-address Bus DRAM Architecture System

A. System Configuration and Suppressing Noise Due to the Coupling Effect

The dual-port, shared-address bus DRAM architecture system will now be described with reference to FIGS. 8-12. The overall system is shown by FIG. 8 and designated generally by reference numeral 500. The system 500 is similar in structure and operation to the dual-port folded-bitline architecture system 100, except for the fact that the master and slave row decoders 108, 128, the master and slave wordline drivers 110, 130, the master and slave port circuits 120, 140, and the master and slave row/column address buffers 118, 138 have been combined in order for the address buses within these components to be shared as known in DRAM circuit design and as described below with respect to the master and slave port circuits. Hence, less control circuitry is required and all of the control circuitry can be provided at one side of a data array 502. Accordingly, the fabrication cost and the amount of surface area used in implementing the system 500 are decreased.

The combined components include a master/slave port control circuit 504 having a master port control circuit 506 (FIG. 11) and a slave port control circuit 508 (FIG. 10), a master/slave row decoder 510, a master/slave wordline driver 512, and a master/slave prefetch address buffer 514 capable of receiving a slave refresh address from a refresh address generator 516, and a slave address and a master address from external circuitry, such as an external processor.

The components of the system 500 which are analogous in structure and operation to components of the system 100, and prior art DRAM architecture systems, are a master column decoder 518, a master I/O switch 520, a master sense amplifier 522, a slave column decoder 524, a slave I/O switch 526, a slave sense amplifier 528, and an I/O buffer 530. Another I/O buffer 532 is included for the slave ports which is different than the output buffer 142, since, unlike system 100, data can be written to a DRAM cell 534 of system 500 through its slave port 536.

However, similarly to system 100, data can be refreshed and read through the slave port 536 and data can be written and read through a master port 540 of each DRAM cell. The data is received by and transmitted from the DRAM cell 534 via the two I/O buffers 530, 532 and input/output data buses 542, 544.

Since each DRAM cell is a dual-port DRAM cell, as shown by the DRAM cell 534, each one contains, besides the master and slave ports 540, 536, two transfer transistors N1 and N2, and one storage capacitor C1.

The system 500 further includes a comparator 546 for receiving a slave row address and a master row address and comparing the addresses to determine if they are identical. As further described below with reference to FIG. 12, the comparator 546 is part of a priority circuit 548 which includes the master port and slave port control circuits 506, 508 for prioritizing data access requests between the master and slave ports 540, 536.

The data array 502 can be similar in structure to the data array 106 (FIG. 3A) or 106A (FIG. 3B) of the dual-port, folded-bitline DRAM architecture system 100. Therefore, similarly to system 100, the system 500 suppresses noise due to wordline-to-bitline coupling, bitline-to-bitline coupling, and bitline-to-substrate coupling by providing at least a complementary bitline on the data array 502 for each true bitline to form bitline pairs. Accordingly, every bitline pair, for example master bitline pair BLM(i), BLM(i+1) and slave bitline pair BLS(i), BLS(i+1), is coupled to its corresponding master sense amplifier circuit 518 or slave sense amplifier circuit 528. Further, each bitline and wordline are coupled to the same capacitors having approximately the same capacitance as their corresponding capacitors within data array 106. Hence, the data array 502 is similar in structure to data array 106, except for the fact that each wordline, for example all master wordlines WLM and slave wordlines WLS, are coupled to the master/slave wordline driver 512 located on one side of the data array 502.

Preferably, the system 500 includes the data array 106A having twisted bitlines, as shown by FIG. 3B, since, as described above, this data array is capable of suppressing noise due to wordline-to-bitline coupling, bitline-to-bitline coupling, and bitline-to-substrate coupling, whereas the data array 106 is only capable of suppressing noise due to wordline-to-bitline coupling and and bitline-to-substrate coupling only.

B. Maintaining Data Integrity

Since in the dual-port, shared-address bus DRAM architecture system 500 the address buses are shared, only one port can be accessed during a particular time, such as during a half cycle of a clock cycle, via a particular wordline. For example, during a first half of the clock cycle, the master port 540 is accessed and during a second half of the clock cycle, the slave port 536 is accessed. This type of access is referred to as staggered.

With reference to the waveform diagrams of FIG. 9, during the refresh mode, e.g., at a time when a burst (or distributed) refresh access occurs, the slave port 536 is not available. When the slave port 536 is not in the refresh mode, the slave port 536 becomes available for a read or write access. A read or write access can be performed by the master port 540 at any time. Hence, unlike system 100, a write access can be performed through the master and slave ports 540, 536 in system 500.

Further, just as with the dual-port, folded bitline DRAM architecture system 100, the dual-port, shared address bus DRAM architecture system 500 can also be described as behaving like an SRAM architecture system, because like the SRAM system, one port is always randomly accessible for a read or write access, while a refresh access is hidden, i.e., the refresh access can be performed at any time without requiring a specific refresh request from an external controller.

The master port and slave port control circuits 506, 508 for the dual-port, shared-address bus DRAM architecture system 500 are designed to operate according to the following five traffic rules:

1. If a DRAM cell is accessed via a wordline through the master port 540 for a read or write operation, then following the read or write operation, the DRAM cell is refreshed through the slave port 536 via the same wordline. Therefore, a simultaneous refresh request slated through the slave port 536 is canceled;

2. If the DRAM cell is accessed via two ports of the same row address wordlines for simultaneously performing write and read operations based upon a write access request slated through the master port 540 and a read access request slated through the slave port 536, it is always desirable, that the most recent data written in the DRAM cell is read out. Therefore, the data is written to the DRAM cell through the master port 540. While the data is being written to the DRAM cell, the system 500 writes the data through to the output data bus 544 via at least the I/O buffer 530. That is, as the data is received by the I/O buffer 530 via the input data bus 542, if the data is to be written to the output data bus 544, the I/O buffer 530 transmits the data to the output data bus 544 while simultaneously transmitting the data to the DRAM cell. Accordingly, the data is simultaneously written to the DRAM cell and to the data bus 544 to satisfy the write and read requests. Since the read request is satisfied, the read request slated through the slave port 536 is canceled;

3. If the DRAM cell is accessed via two ports of the same row address wordline for two simultaneous read operations, the read request slated through the slave port 536 is ignored, since one read operation is adequate;

4. A write request slated through the slave port 536 has a higher priority than a simultaneous write request slated through the master port 540, and accordingly, the write request slated through the master port 540 is canceled. This is because the write request slated through the master port 540 occurs one-half cycle before the write request slated through the slave port 536. Accordingly, the write request slated through the slave port 536 has more recent or updated data than the data corresponding to the write request slated through the master port 540. Therefore, in order to prevent an unnecessary access through the master port 540 to perform the write request (the access is deemed unnecessary, since the data will be overwritten one-half cycle later by the write request slated through the slave port 536), the write request slated through the master port 540 is canceled; and

5. A write request slated through the slave port 536 has a higher priority than a simultaneous read request slated through the master port 540 of the same row address, and accordingly, the write request slated through the master port 540 is canceled. This rule is similar to rule number two above. That is, if the DRAM cell is accessed via two ports for simultaneously performing write and read operations to the same row address, it is always desirable, that the most recent data written in the DRAM cell is read out. Therefore, the data is written to the DRAM cell through the slave port 536. While the data is being written to the DRAM cell, the system 500 writes the data through to the output data bus 544 via the I/O buffer 532. That is, as the data is received by the I/O buffer 532 via the input data bus 542, if the data is to be written to the output data bus 544, the I/O buffer 532 transmits the data to the output data bus 544 while simultaneously transmitting the data to the DRAM cell. Accordingly, the data is simultaneously written to the DRAM cell and to the output data bus 544 to satisfy the write and read requests. Since the read request is satisfied, the read request slated through the master port 540 is canceled.

The two control circuits 508, 506 for the slave and master ports 536, 540, respectively, are shown schematically by FIGS. 10 and 11. The slave port control circuit 508 has seven input nodes for receiving seven different input signals. These signals include a slave write request signal SW, a master read request signal MR, a slave read request signal SW, a refresh request signal F, a slave port control signal RAS, a master write request signal MW, and a match signal M. The slave write, master write, slave read, and master read request signals are received from an external processor, such as a central processing unit, for performing write and/or read operations. The slave port control signal RAS, as indicated above, is received from an external processor, as well.

The match signal M is provided by the comparator 546 as further described below with reference to FIG. 12. The slave port control circuit 508 also has two output nodes for transmitting a slave port output signal SS and a first write-through signal WTS1 to slave port circuitry, such as the master/slave row decoder 510, the master/slave wordline driver 512 and the I/O buffer 532, for allowing or canceling access to the slave port 536, in accordance with the logic value of the match signal M. The master/slave row decoder 510 selects one master/slave wordline driver 512 which drives one slave port wordline WLS, as known in conventional 15. DRAM circuit design.

With reference to FIG. 11, the master port control circuit 506 includes six input nodes for receiving the slave write request signal SW, the master write request signal MW, the slave read request signal SR, the master read request signal MR, the slave port controls signal RAS, and the match signal M. The master port control circuit 506 further includes two output nodes for transmitting a master port output signal MS and a second write-through signal WTS2 to master control circuitry, such as the master/slave row decoder 510, the master/slave wordline driver 512, and the master I/O buffer 530, for allowing or canceling access to the master port 540, in accordance with the logic value of the match signal M. The master/slave row decoder 510 selects one master/slave wordline driver 512 which drives one master port wordline WLM, as known in conventional DRAM circuit design.

The operation of the slave port and master port control circuits 508, 506 will now be described with continued reference to FIGS. 10 and 11, and to the priority circuit 548, as shown by FIG. 12, which includes the comparator 546, and the slave port and master port control circuits 508, 506 included within the master/slave port control circuit 504, as shown by FIG. 8.

The priority circuit 548 uses five task request signals, i.e., the slave write request signal SW, the master write request signal MW, the master read request signal MR, the slave read request signal SR, and the refresh request signal F, to determine whether the slave port output signal SS outputted by the slave port control circuit 508 and the master port output signal MS outputted by the master port control circuit 506 should be a logic high or a logic low. The slave and master port control circuits 508, 506 are designed to transmit a logic high slave port output signal SS and master port output signal MS, respectively, to master/slave port control circuitry, such as the master/slave row decoder 510 and master/slave wordline driver 512, when the slave and master row addresses inputted into the comparator 546 are different to allow access to the DRAM cell 534 through the slave port 536 and master port 540.

The slave and master row addresses are extracted from the slave and master addresses received by the master/slave prefetch address buffer 514 from external circuitry.

If the master and slave row addresses are the same, i.e., the row addresses corresponds to the same row of DRAM cells as determined by the comparator 546, then the slave port control circuit 508 is designed to transmit a logic low slave port output signal SS to the slave port circuitry to cancel access to the slave port 536, since the task requested through the master port 540 has a higher or equal priority, unless the task requested through the slave port 536 is a write access request and the task requested through the master port 536 is either a read or write access request.

In particular, if the task requested through the slave port 536 is a write access request, regardless of the task requested through the master port 540, then the slave port control circuit 508 is designed to transmit a logic high slave port output signal SS to the slave port circuitry to allow access to the DRAM cell 534 through the slave port 536. If the task requested through the slave port 536 is a write access request and the task requested through the master port 540 is a read access request, then the slave port control circuit 508 is also designed to transmit a logic high write-through signal WTS1 to simultaneously perform both requests.

Accordingly, after the comparator 546 compares the master row and slave row addresses, a logic high match signal M is transmitted to the master port and slave port control circuits 506, 508, if the row addresses are identical, i.e., correspond to the same row of DRAM cells. The slave port control signal RAS is also disabled. As a result, the slave port control circuit 508 transmits a logic low slave port output signal SS to the slave port circuitry to cancel access to the slave port 536, and hence, cancel the task scheduled to be performed through the slave port 536. Access to the master port 540 is never canceled by transmitting a logic high master port output signal MS, and hence, the task scheduled through the master port 540 is performed, unless the access request slated through the slave port 536 is a write access request.

If the access request slated through the slave port 536 is a write access request, i.e., the slave write signal SW is a logic high signal, the slave port output signal SS is a logic high signal, since the logic high slave write signal SW is directly inputted to an OR gate of the slave port circuit 508 (see FIG. 10), from which the slave port output signal SS is outputted from. Accordingly, the slave port control circuitry is enabled and access is allowed through the slave port 536. At the same time, the master port control circuitry is disabled, since a logic low master port output signal MS is transmitted to the master port control circuitry from the master port control circuit 506, as further illustrated below with reference to Table 2.

In essence, the priority circuit 548 prioritizes the tasks scheduled to be performed, where the tasks scheduled to be performed through the master port 540 are given a higher or equal priority to the tasks scheduled to be performed through the slave port 536 when the row addresses are identical, unless both tasks scheduled to be performed through the slave port 536 and the master port 540 are write accesses.

At the slave port control circuit 508, the master write request signal MW is inputted through a first inverter INV1. The output of the inverter INV1 is inputted to a first and a second AND gate, AND1, AND2. The output of the first AND gate, AND1, is inputted to a top input of the multiplexer MUX1. The slave read request signal SR is inputted to the first AND gate, AND1, a bottom input of the multiplexer MUX1, and a second inverter INV2. The output of the second inverter INV2 is inputted to the second AND gate, AND2. The refresh request signal F is inputted to the second AND gate, AND2, and a bottom input of the multiplexer MUX2. The output of the second AND gate, AND2, is inputted to a top input of the multiplexer MUX2. The master read request signal MR is inputted to an input of a third AND gate, AND3.

The output of the multiplexer MUX1 is inputted to a fourth AND gate, AND4. The output of the multiplexer MUX2 is inputted to a fifth AND gate, AND5. The slave port control signal RAS is inputted to the fourth and fifth AND gates, AND4, AND5. The output of the fourth and fifth AND gates, AND4, AND5 is inputted to the OR gate, which outputs the slave port output signal SS.

The slave write request signal SW is inputted to the OR gate and the third AND gate, AND3. The output of the third AND gate, AND3, is inputted to a top input of multiplexer MUX3. A bottom input of the multiplexer MUX3 is tied to ground and the output of the multiplexer MUX3 is the first write-through signal WTS1.

As described above, based on the logic level of the match signal M, the output of the multiplexers MUX1, MUX2, MUX3 is either the top input or the bottom input, i.e., if the match signal M is a logic high signal, the top input of the multiplexers MUX1, MUX2, MUX3 is outputted, and if the match signal M is a logic low signal, the bottom input of the multiplexers MUX1, MUX2, MUX3 is outputted. The three multiplexers MUX1, MUX2, MUX3 receive the match signal M transmitted by the comparator 546. Hence, the match signal M becomes a control signal for the multiplexers MUX1, MUX2, MUX3.

At the master port control circuit 506, the slave write request signal SW is inputted to an inverter INV. The output of the inverter INV is inputted to a top input of multiplexer MUXA. The output of the multiplexer MUXA is inputted to an AND gate, ANDA. The slave port control signal RAS is also inputted to the AND gate, ANDA.

The master write request signal MW and the master read request signal MR are inputted to an OR gate, OR. The output of the OR gate, OR, is inputted to a bottom input of the multiplexer MUXA. The master read request signal MR is also inputted to an AND gate, ANDB. The slave read request signal SR is also inputted to the AND gate, ANDB. The output of the AND gate, ANDB, is inputted to a top input of multiplexer MUXB. A bottom input of the multiplexer MUXB is tied to ground and the output of the multiplexer MUXB is the second write-through signal WTS2.

Based on the logic level of the match signal M, the output of the multiplexers MUXA, MUXB is either the top input or the bottom input. The two multiplexers MUXA, MUB receive the match signal M transmitted by the comparator 546. Hence, the match signal M becomes a control signal for the multiplexers MUXA, MUXB. If the match signal M is a logic high signal, the top input of multiplexers MUXA, MUXB is outputted by the multiplexers MUXA, MUXB. If the match signal M is a logic low signal, the bottom input of multiplexers MUXA, MUXB is outputted by the multiplexers MUXA, MUXB.

Table 2 shows the input configurations for the slave write request signal SW, the master write request signal MW, the read request signal R, and the refresh request signal F, the outputs of the multiplexers MUX1, MUX2, MUX3, MUXA, MUXB and the outputs of the slave port and master port control circuits 508, 506, i.e., the slave port output signal SS, the master port output signal MS, and the first and second write-through signals WTS1, WTS2, for a logic high a logic low match signal M.

TABLE 2 Slave write, master write, read and refresh request input configurations, the outputs of the multiplexers, and slave port and master port control circuit outputs for a logic low and a logic high match signal. MUX1 MUX2 MUX3 MUXA MUXB M SW MW MR SR F Output Output Output Output Output SS MS WTS1 WTS2 0 0 1 0 0 1 0 1 0 1 0 1 1 0 0 0 0 1 0 1 0 1 0 0 1 0 1 1 0 0 0 1 0 1 0 0 0 0 0 1 0 1 1 0 0 0 1 1 0 0 0 0 0 0 1 0 1 1 0 0 1 0 1 0 0 1 0 0 0 1 0 0 1 0 0 1 0 1 0 1 0 0 0 0 1 1 0 1 0 1 1 1 0 1 0 0 1 0 1 0 0 1 0 1 0 1 1 1 0 0 0 0 0 0 0 0 1 0 0 0

It is noted that since a slave write request, a slave read request and a refresh request cannot be simultaneously performed by the slave port 536 (see FIG. 9), if either the slave write request signal SW, the slave read request signal SR or the refresh request signal F is a logic high signal, the other two signals are logic low signals. In other words, the slave write request signal SW, the slave read request signal SR, and the refresh request signal F are mutually exclusive signals, since only one access request can be made through the slave port 536 at a particular time.

Further, it is noted that when the slave write request signal SW is a logic high signal, the master write request signal MW and the master read request signal MR cannot both be logic high signals. Further still, it is noted that when the master write request signal MW is a logic high signal, the slave write request signal SW and the slave read request signal SR cannot both be logic high signals.

Table 2 does not show any input configurations where only one of the request signals is a logic high signal, because if this occurs, there can be no address collisions and the master and slave port control circuits 506, 508 are not accessed.

Additionally, it is noted that Table 2 takes into consideration that when the match signal M is a logic low signal, the slave port control signal RAS is enabled, and when the match signal M is a logic high signal, the slave port control signal RAS is disabled.

According to Table 2, when the master and slave row addresses are not identical, the task requested through the slave port 536 and the task requested through the master port 540 are performed simultaneously by outputting a logic high slave port output signal SS and a logic high master port output signal MS.

In the case of a write-slave read request, i.e., when both a write and a read request are simultaneously requested via the same row address to be performed through the master and slave ports 540, 536, respectively, the slave port control circuit 508, shown by FIG. 10 outputs a low write-through signal WTS1, and the master port control circuit 506, shown by FIG. 11, outputs a high write-through signal WTS2. In the case of a master read-slave write request, the slave port control circuit 508 outputs a high write-through signal WTS1, and the master port control circuit 506 outputs a low write-through signal WTS2. The write-through signals WTS1, WTS2 allow the system 500 to simultaneously perform a write and read access request. This is accomplished by simultaneously writing the data to the DRAM cell 160 and writing the data through to the output data bus 544 to simultaneously perform both requests.

In conclusion, since the dual-port, shared-address bus DRAM architecture system 500 inputs the task request signals and the slave port control signal RAS within the priority circuit 548, when an address collision occurs, the system 500 maintains data integrity, while maintaining a smooth, simultaneous dual-port operation.

III. Method for Fabricating an Open-bitline, Dual-port DRAM Architecture System

A method will now be discussed in fabricating a compact open-bitline, dual-port DRAM array having a size of approximately two times smaller than an array of a single-port SRAM array. With such a small array size, the open-bitline, dual-port DRAM array would be more attractive to IC design engineers than the single-port SRAM array.

The method entails fabricating a vertical transfer gate electrode for each transfer device or transistor in the array to realize a small size for each DRAM cell in the dual-port DRAM array. FIG. 13 illustrates an open-bitline, dual-port DRAM cell designated generally by reference numeral 20 and FIG. 14 illustrates a dual-port DRAM cell array layout. As shown, by FIGS. 13-15, each DRAM cell 20 includes at least a deep-trench storage capacitor 22, two active areas 24 formed preferably of silicon, two buried-strap node contacts 32, and two gate-oxide transistor channels 26 fabricated within a semiconductor substrate 29.

One of the buried-strap node contacts 32 and one of the channels 26 are formed at the intersection of the deep-trench storage capacitor 22 and one of the two active areas 24. The other buried-strap node contact 32 and the other channel 26 are formed at the intersection of the deep-trench storage capacitor 22 and the other active area 24. Each DRAM cell 20 is also connected to bitlines BL1, BL2 via bitline contacts CB1, CB2, and to wordlines WL1, WL2.

As described below with reference to the cross-sectional view of FIG. 15 taken along line A—A of FIG. 14, two vertically-oriented access transistors 34 are formed within the substrate 29. One transistor 34 is fabricated in each side of the deep-trench storage capacitor 22. For example, as further described below, the transistor channel 26 for each transistor 34 is formed along a sidewall 28 of an upper region of the deep-trench storage capacitor 22. A description of fabricating a deep-trench-sidewall vertical access transistor for a single-port DRAM cell is provided in Radens et al., “A 0.135 μm²6F² Trench-Sidewall Vertical Device Cell for 4Gb/16Gb DRAM,” VLSI Symposium 2000, p. 39, 2000; and Gruening et al., “A Novel Trench DRAM Cell with a VERtIcal Access Transistor and BuriEd STrap (VERI BEST) for 4Gb/16Gb,” Electron Devices Meeting 1999, p. 25, 1999; the contents of which are incorporated herein by reference.

The unique array cell layout enables the addressing of the single storage, deep-trench capacitor 22 along two independent wordlines WL1, WL2 and two bitlines BL1, BL2 through bitline contacts CB1, CB2. The bitlines BL1, BL2 are preferably formed using a phase-shift mask technique or a sidewall image technique, as known in the art. The deep-trench capacitor 22 and trench-sidewall, vertically-oriented access transistors 34 are formed using conventional processing techniques, such as those described in Gruening et al. and Radens et al.

During fabrication of each open-bitline, dual-port DRAM cell 20 of the open-bitline DRAM cell array layout as shown by FIG. 14, a deep-trench DT is patterned into the substrate 29 using lithography and etching, such as reactive-ion etching (RIE), as known in the art; a thin-node capacitor dielectric 30 is preferably formed using low-pressure chemical vapor deposition (LPCVD) of silicon nitride along the sidewall 28 of the deep-trench DT. The thin-node capacitor dielectric 30 isolates the deep-trench DT from the substrate 29.

An isolation collar 31 is formed along the upper region of the deep-trench DT by techniques known in the art. The isolation collar 31 is preferably formed of silicon dioxide. The deep-trench DT is filled with a semiconductor using conventional filling techniques. The deep-trench DT is preferably filled with LPVCD As-doped polysilicon. The buried-strap node contact 32 is opened in the isolation collar 31 by wet chemical etching, such as HEF-containing solution, and LPCVD silicon is deposited and etched back, as known in the art.

A trench-top isolation dielectric 33 is deposited using high-density plasma (HDP) deposition to obtain an anisotropic deposition along the horizontal surface of the deep-trench storage capacitor 22. The gate-oxide vertical channel 26 is grown along the sidewall 28 of the upper region of the deep-trench DT using conventional oxidation techniques. A vertical transfer gate electrode 35 is formed in the upper region of the deep-trench DT, preferably by LPCVD deposition of polysilicon.

An isolation trench 36 is formed which bisects the deep-trench DT using conventional techniques, such as ligthography, RIE, deposition and chemical-mechanical planarization (CMP). The isolation trench 36 separates the gate-oxide vertical channel 26 into left and right channels 26, separates the gate electrode 35 into left and right gate electrodes 35, and separates the buried-strap node contact 32 into left and right buried-strap node contacts 32.

The left components, i.e., the left gate-oxide channel 26, the left gate electrode 35, and the left buried-strap node contact 32, are the components of one of the transistors 34 of the dual-port DRAM cell 20. The right components, i.e., the right gate-oxide channel 26, the right gate electrode 35, and the right buried-strap node contact 32, are the components of the other transistor 34 of the dual-port DRAM cell 20. The buried-strap node contacts 32 serve as one source/drain region (depending on the type of operation being performed, i.e., a write or read operation) of the gate-oxide channels 26 and are formed by the outdiffusion of As from the deep-trench DT. The other source/drain region is located at the surface of the substrate 29.

The components of the vertically-oriented access transistors 34 and the wordlines WL1, WL2 are formed using conventional deposition, lithography, RIE and ion-implantation semiconductor processing techniques. The bitline contacts CB1, CB2 are formed to contact implanted or diffused source/drain n+ regions 38 above the implanted p-well 39 in the array. One of the vertically-oriented access transistors 34 is accessed by bitline BL1 via bitline contact CB1 and the other vertically-oriented access transistor 34 is accessed by bitline BL2 via bitline contact CB2.

It is contemplated that the two vertically-oriented access transistors 34 are MOSFET transistors where the gate-oxide channels 26 are inversion channels as known in the art for MOSFET transistors.

IV. Method for Fabricating a Folded-bitline, Dual-port DRAM Architecture System

A method will now be described in fabricating various embodiments of a folded-bitline, dual-port DRAM array, as shown by FIGS. 3A and 3B, capable of suppressing noise due to wordline-to-bitline, bitline-to-bitline and bitline-to-substrate coupling. Several of the embodiments provide a more compact dual-port DRAM array than prior art dual-port DRAM arrays by using a half-pitch bitline layout. The method of fabrication is similar to the method described above for fabricating the open-bitline, dual-port DRAM architecture system.

That is the method entails fabricating a vertical transfer gate electrodes for each transfer device or transistor in the array to realize a small size for each DRAM cell in the dual-port DRAM array. A method of fabricating a folded-bitline, dual-port DRAM array of a first embodiment will now be described with reference to FIGS. 16-21. With reference to FIGS. 16 and 17, there is shown a DRAM cell designated generally by reference numeral 600. The dual-port DRAM cell 600 includes at least a deep-trench storage capacitor 604, two active areas 606 formed preferably of silicon, two buried-strap node contacts 639, and two gate-oxide vertical channels 602.

One of the buried-strap node contacts 639 and one of the channels 602 are formed at the intersection of the deep-trench storage capacitor 604 and one of the two active areas 606. The other buried-strap node contact 639 and the other channel 602 is formed at the intersection of the deep-trench storage capacitor 604 and the other active area 606. The DRAM cell 600 is also connected to bitlines 612 and 614 (BL1, BL2) via bitline contacts 616 and 618 (CB1, CB2), and to wordlines 608 and 610 (WL1, WL2).

Wordlines 608 (WL1) and 610 (WL2) are electrically connected to vertical transfer gate electrodes 611 (FIG. 17) formed in an upper region of the deep-trench capacitor 604. Bitlines 612 (BL1) and 614 (BL2) are connected to source/drain diffusion regions 615 (FIG. 17) via bitline contacts 616 (CB1) and 618 (CB2). A complementary bitline 620 (BL1′) passes over the DRAM cell 600 and another complementary bitline 622 (BL2′) runs adjacent to the DRAM cell 600.

With continued reference to the cross-sectional view of the folded-bitline, dual-port DRAM cell 600 taken along line C-C in FIG. 16, as shown by FIG. 17, and as further described below, two vertically-oriented access transistors 601 are fabricated within the substrate 630.

One transistor 601 is formed in each side of the deep-trench storage capacitor 604. For example, as further described below, the channel 602 for each transistor 601 is formed along a sidewall 638 of an upper region of the deep-trench storage capacitor 604. A description of fabricating a deep-trench-sidewall, vertically-oriented access transistor for a single-port DRAM cell is provided in Radens et al. And Gruening et al.

The DRAM cell 600 is fabricated on a semiconductor substrate 630, such as single-crystal silicon. A deep-trench DT is patterned into the substrate 630 using conventional fabrication techniques as known in the art. A thin-node capacitor dielectric 641 is formed using conventional techniques along the sidewall 638 of the deep-trench DT. The thin-node capacitor dielectric 641 isolates the deep-trench DT from the substrate 630.

A dielectric isolation collar 636 is formed along a sidewall 638 of the deep-trench DT, preferably by silicon dioxide. The deep-trench DT is filled with a semiconductor using conventional filling techniques. The deep-trench DT is preferably filled with LPCVD As-doped polysilicon. The buried-strap node contact 639 is opened in the isolation collar 636 by wet chemical etching, such as HF-containing solution, and LPCVD silicon is deposited and etched back, as known in the art.

A trench-top isolation dielectric 640 is deposited using high-density (HDP) deposition to obtain an anisotropic deposition along the horizontal surface of the deep-trench storage capacitor 604. The gate-oxide vertical channel 602 is grown along the sidewall 638 of the upper region of the deep-trench DT using conventional growing techniques. A vertical transfer gate electrode 611 is formed in the upper region of the deep-trench DT, preferably by LPCVD deposition of silicon.

An isolation trench 634 is formed which bisects the deep-trench DT, below the inter-level dielectric 632, using conventional techniques. The isolation trench 632 separates the gate-oxide vertical channel 602 into left and right channels 602, separates the gate electrode 611 into left and right gate electrodes 611, and separates the buried-strap node contact 639 into left and right buried-strap node contacts 639. An inter-level dielectric (ILD) 632 is formed between the wordlines 608 and 610.

The left components, i.e., the left transistor channel 602, the left gate electrode 611, and the left buried-strap node contact 639, are the components of one of the transistors 601 of the dual-port DRAM cell 600. The right components, i.e., the right transistor channel 602, the right gate electrode 611, and the right buried-strap node contact 639, are the components of the other transistor 601 of the dual-port DRAM cell 600. The buried-strap node contacts 639 serve as the negative source/drain regions (depending on the type of operation being performed, i.e., a write or read operation) of the gate-oxide channels 602 and are formed by the outdiffussion of As from the deep-trench DT.

The vertically-oriented access transistor elements are formed in a p-well region 642 of the substrate 630. The n+ regions 615 at the surface of the semiconductor wafer serve as source/drain diffusion regions for the transistor channels 602.

It is contemplated that the two vertically-oriented access transistors 601 are MOSFET transistors where the gate-oxide channels 602 are inversion channels as known in the art for MOSFET transistors.

The wordlines 608 and 610 are electrically connected to the transistor gate electrodes 611 formed in the upper region of the capacitor 604. The bitlines 612 and 614 are electrically connected to the source/drain diffusion regions (n+) 615 of the transistor channel 602 via the bitline contacts 616 and 618. The complementary bitline 620 passes over the DRAM cell 600. The bitlines 612 and 614 and the complementary bitline 620 are fabricated preferably by a phase-shift mask technique, a sidewall image technique, or a sidewall spacer technique, as known in the art.

The deep-trench capacitor 604, and other structures as shown by FIG. 17, may be formed by conventional semiconductor processing techniques, such as photolithography, chemical vapor 10 deposition (CVD), wet chemical etching, and dry etching. A description of a preferred fabrication process of the trench-sidewall vertical access transistor is provided in Gruening et al. and Radens et al. A description of fabricating a gate connector which should is of interest to the presently disclosed subject matter is described in U.S. Pat. No. 6,090,660 issued on Jul. 18, 2000 to a common assignee; the contents of which are incorporated herein by reference.

A method will now be described according to the present invention in fabricating the bitlines 612 and 614 connected to the DRAM cell 600 and the complementary bitline 620 passing over the DRAM cell 600 in a manner which provides for the DRAM cell 600 to use less chip area than conventional DRAM cells. FIG. 18 illustrates a cross-sectional view of the top surface of the folded-bitline, dual-port DRAM cell shown by FIG. 16 taken along line C—C in FIG. 16. The bitlines (or conductors) 612, 614, 620 and 622 are formed along at least one sidewall 646 of a patterned feature (e.g., a series of notches) 650 on the inter-level dielectric (ILD) 632. At least two bitlines 612 and 620 connected to at least the DRAM cell 600 are formed within a 2F pitch of the patterned feature 650.

For comparison purposes, a cross-sectional view of a prior art open-bitline, dual-port DRAM cell is illustrated by FIG. 19. The patterned features 660 and 662 can serve as bitlines or conductors. Only a single conductor is formed within the 2F pitch of the patterned feature 660.

The method according to the present invention in fabricating bitlines of a dual-port DRAM array, such as bitlines 612 and 620 connected to the DRAM cell 600, within a 2F pitch on the inter-level dielectric (ILD) 632 entails using conventional lithography techniques to form a series of notches 650 in the inter-level dielectric (ILD) 632. Conductive material 648 is then deposited into the notches 650 and anisotropically etched in a vertical direction to leave the conductive material 648 along the sidewalls 646 of the notches 650. The bitlines are fabricated after the components of each DRAM cell 600 and all the array wordlines are formed.

FIG. 20 is a diagram illustrating a top view of the half-pitch bitline layout of the data array having the DRAM cell 600 as shown by FIG. 16. Again, it is evident, that with the bitline fabrication method of the present invention, two bitlines are formed and provided within the 2F pitch of the patterned feature 650, i.e., the series of notches.

FIG. 21 illustrates a folded-bitline, dual-port DRAM cell array layout according to the first embodiment of the present invention designated generally by reference numeral 700. The array layout 700 includes a plurality of DRAM cells, including the DRAM cell 600 described above. Each of the DRAM cells in the array layout 700 is fabricated with the method of the present invention described above.

It can be observed from the array layout 700 that two types of DRAM cells are present: one DRAM cell which has a complementary bitline pass over the cell (e.g., DRAM cell 600) and another DRAM cell which has a bitline pass over the cell (e.g., DRAM cell 664). Both types of DRAM cells are fabricated with the method of the present invention described herein and include analogous elements. For example, unlike DRAM cell 600, the DRAM cell 664 provides that the bitline contact 616 contacts a complementary bitline instead of a true bitline and that the bitline contact 618 also contacts a complementary bitline instead of a true bitline.

FIGS. 22-26 illustrate folded-bitline, dual-port DRAM cell array layouts according to other embodiments of the present invention. Each DRAM cell of these array layouts is fabricated according to the fabrication method described above with reference to the first embodiment and includes analogous components. The layout of the DRAM cells in the arrays shown by FIGS. 22-26 varies as described below in conjunction with each one of these figures.

With reference to FIG. 22, there is illustrated a folded-bitline, dual-port DRAM cell array layout according to a second embodiment of the present invention and designated generally by reference numeral 800. In this embodiment, a true bitline, such as bitline 802, passes over each DRAM cell 804 of the array layout 800. Further, in this embodiment, a top active area 806 of each DRAM cell 804 encircles a bitline to the left of the DRAM cell and a bottom active area 810 of each DRAM cell 804 encircles a bitline to the right of the DRAM cell, as is the case with the first embodiment. In other words, in this embodiment, each DRAM cell is similar to DRAM cell 664 in the first embodiment.

With reference to FIG. 23, there is illustrated a folded-bitline, dual-port DRAM cell array layout according to a third embodiment of the present invention and designated generally by reference numeral 900. In this embodiment, a complementary bitline, such as bitline 902, passes over each DRAM cell 904 of a first column of the array layout 900 and a true bitline, such as bitline 906, passes over each DRAM cell 908 of a second column. In other words, as can be seen from FIG. 23, the type of bitline passing over each DRAM cell alternates with each column in the array layout 900.

Further, in this embodiment, in one array column, a top active area 910 of each DRAM cell encircles a bitline to the right of the DRAM cell and a bottom active area 912 of each DRAM cell encircles a bitline to the left of the DRAM cell. In an adjacent column, a top active area 914 of each DRAM cell encircles a bitline to the right of the DRAM cell and a bottom active area 916 of each DRAM cell encircles a bitline to the left of the DRAM cell.

With reference to FIG. 24, there is illustrated a folded-bitline, dual-port DRAM cell array layout according to a fourth embodiment of the present invention and designated generally by reference numeral 1000. In this embodiment, a true bitline, such as bitline 1002, passes over each DRAM cell 1004 of the array layout 1000. Further, in this embodiment, each active area 1006 of each DRAM cell 1004 is shared by two DRAM cells, i.e., one DRAM cell to the left of the active area 1006 and one DRAM cell to the right of the active area 1006, except for the top active area of each DRAM cell in the first column and the bottom active area of each DRAM cell in the last column of the array layout 1000.

FIGS. 25 and 26 illustrate open-bitline, dual-port DRAM cell array layouts according to the present invention. The array layout shown by FIG. 25 is designated generally by reference numeral 1100 and the array layout shown by FIG. 26 is designated generally by reference numeral 1200. Each DRAM cell 1102 of these array layouts 1100 and 1200 includes at least a deep-trench capacitor 1104, two active areas 1106, and two bitline contacts 1108. The array layout 1100 requires less chip area than the array layout 1200, since each bitline of the array layout 1100 is used by at least two DRAM cells 1102, whereas each bitline of the array layout 1200 is used by only one DRAM cell 1102. The DRAM cells 1102 of the array layouts 1100 and 1200 are fabricated according to the method described above with reference to FIGS. 13-15.

The embodiments illustrated by FIGS. 21-24 include bitlines formed within the 2F pitch of a patterned feature as described above with reference to FIGS. 18-20, whereas the embodiments illustrated by FIGS. 25-26 include bitlines which are not formed within the 2F pitch and hence, require more chip area. The area per unit DRAM cell required for each of the embodiments is indicated by Table 3.

TABLE 3 Area per unit required for each DRAM cell of the first through the sixth embodiments. Embodiment FIG. Area per unit DRAM cell First 21 18F² Second 22 16F² Third 23 18F² Fourth 24 12F² Fifth 25 36F² Sixth 26 48F²

It can be observed from Table 3 that the fourth embodiment, as shown by FIG. 24, has the most compact cell area. The 12F² compact cell area of the fourth embodiment is obtained by sharing bitline contacts and half-pitch bitlines by at least two dual-port DRAM cells, and by using a vertically-oriented access transistor. Further, the capacitor size of every array layout described with reference to FIGS. 21-26 is more than three times larger than that of a single-port SRAM, therefore the DRAM cell retention time is greatly improved. This enables the folded-bitline, dual-port DRAM array layouts described herein to be approximately four times smaller than the conventional single-port SRAM array layouts with equivalent or better performance.

It is contemplated that one can implement the folded-bitline, dual-port DRAM array layouts shown by FIGS. 21-24 with twisted bitlines as shown by FIGS. 3B and 3C to cancel out not only wordline-to-bitline coupling and bitline-to-substrate coupling, but also bitline-to-bitline coupling.

What has been described herein is merely illustrative of the application of the principles of the present invention. For example, the functions and methods described above as the best mode for operating and fabricating the DRAM array architectures of the present invention are for illustration purposes only. As a particular example, for instance, the array layouts described herein as being fabricated with the methods of the present invention, may be fabricated with different design layouts, other than those design layouts described herein with reference to FIGS. 21-26. Hence, other arrangements and methods may be implemented by those skilled in the art without departing from the scope and spirit of this invention. 

We claim:
 1. A memory system comprising a plurality of dynamic random access memory (DRAM) cells arranged in an array, each of the DRAM cells comprising a first port and a second port, the first port and the second port enabling access to a same DRAM cell via a first device and a second device respectively, wherein the first and second devices are oriented in a vertical direction.
 2. The memory system according to claim 1, wherein each of the DRAM cells further comprises a deep-trench having a first and second trench sidewall, wherein a portion of the first trench sidewall serves as a vertical channel for the first device and a portion of the second trench sidewall serves as a vertical channel for the second device.
 3. The memory system according to claim 2, wherein the first and second trench sidewalls serve as capacitor nodes for a deep-trench capacitor defining a shallow-trench isolation region substantially between the first and second trench sidewalls.
 4. The memory system according to claim 3, wherein the shallow-trench isolation region is configured for separating components of the first and second devices, and wherein the first and second devices are transistors.
 5. The memory system according to claim 2, wherein each of the first and second devices include a gate electrode adjacent to the portions of the trench sidewalls.
 6. The memory system according to claim 2, wherein each of the first and second devices include a source/drain region electrically coupled to the channel.
 7. The memory system according to claim 1, wherein the array is either a folded-bitline array or an open-bitline array.
 8. The memory system according to claim 1, wherein the array suppresses noise due to wordline-to-bitline, bitline-to-bitline and bitline-to-substrate coupling.
 9. The memory system according to claim 1, further comprising a first wordline electrically coupled to the first device and a second wordline electrically coupled to the second device.
 10. The memory system according to claim 1, wherein the array includes a plurality of true bitlines and complementary bitlines formed at half-pitch.
 11. The memory system according to claim 1, wherein the first device is accessed via the first port by a first bitline and the second device is accessed via the second port by a second bitline, wherein the first and second bitlines are connected to the corresponding DRAM cell via bitline contacts.
 12. The memory system according to claim 10, wherein each bitline contact is shared by at least two adjacent DRAM cells of the array.
 13. The memory system according to claim 11, wherein the bitline contacts are connected to the corresponding DRAM cell via negative source/drain regions in proximity to the first and second devices.
 14. The memory system according to claim 1, wherein at least one bitline passes over each of the DRAM cells.
 15. The memory system according to claim 14, wherein the at least one bitline is a complementary bitline.
 16. The memory system according to claim 1, wherein the size of the corresponding DRAM cell is in the range of 12F² to 18F².
 17. A semiconductor structure comprising at least two vertically-oriented transistors, and at least one deep-trench capacitor defining a shallow-trench isolation region for dividing components of the at least two vertically-oriented transistors, the at least two vertically-oriented transistors providing dual access to the at least one deep-trench capacitor, wherein a portion of the at least one deep-trench capacitor is co-extensive with at least a portion of one of the at least two vertically oriented transistors.
 18. The semiconductor structure according to claim 17, wherein the semiconductor structure is a dual-port DRAM cell.
 19. The semiconductor structure according to claim 18, wherein the size of the DRAM cell is in the range of 12F² to 18F². 